04:00
2026-06-15
arxiv.org
machine-learning
Muon$^p$: Muon with Fractional Spectral Powers
Researchers introduced Muon$^p$, a new optimizer that uses fractional spectral-power updates to interpolate between Muon and gradient descent, improving finetuning performance on billion-scale models.โฆ